Combining Background Knowledge and Learned Topics

نویسندگان

  • Mark Steyvers
  • Padhraic Smyth
  • Chaitanya Chemudugunta
چکیده

Statistical topic models provide a general data-driven framework for automated discovery of high-level knowledge from large collections of text documents. Although topic models can potentially discover a broad range of themes in a data set, the interpretability of the learned topics is not always ideal. Human-defined concepts, however, tend to be semantically richer due to careful selection of words that define the concepts, but they may not span the themes in a data set exhaustively. In this study, we review a new probabilistic framework for combining a hierarchy of human-defined semantic concepts with a statistical topic model to seek the best of both worlds. Results indicate that this combination leads to systematic improvements in generalization performance as well as enabling new techniques for inferring and visualizing the content of a document.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Background Knowledge and Learned Topics Address for Correspondence

Statistical topic models provide a general data-driven framework for automated discovery of high-level knowledge from large collections of text documents. While topic models can potentially discover a broad range of themes in a data set, the interpretability of the learned topics is not always ideal. Human-defined concepts, on the other hand, tend to be semantically richer due to careful select...

متن کامل

بررسی دیدگاه دندانپزشکان در زمینه آموزش مباحث مربوط به مدیریت مطب دندانپزشکی

Background and Aims: The aim of the present study was to investigate self-perceived need to learn practice management and self reported knowledge in this regard among dentists. Materials and Methods: A group of five academic staff members of Community Oral Health Department (Tehran Dental School) and four dentists formed an expert panel to develop the content of the course. This group develope...

متن کامل

Combining feature norms and text data with topic models.

Many psychological theories of semantic cognition assume that concepts are represented by features. The empirical procedures used to elicit features from humans rely on explicit human judgments which limit the scope of such representations. An alternative computational framework for semantic cognition that does not rely on explicit human judgment is based on the statistical analysis of large te...

متن کامل

Combining Gaussian Processes and Conventional Path Planning in a Learning from Demonstration Framework

Today, robots are already able to solve specific tasks in laboratory environments. Since everyday environments are more complex, the robot skills required to solve everyday tasks cannot be known in advance and thus not be programmed beforehand. Rather, the robot must be able to learn those tasks being instructed by users without any technical background. Hence, Learning from Demonstration (LfD)...

متن کامل

Exploiting Connectivity for Case Construction in Learning by Reading

One challenge faced by cognitive systems is how to organize information that is learned by reading. Analogical reasoning provides a method for immediately using learned knowledge, and analogical generalization potentially provides a means to integrate knowledge across multiple sources. To use analogy requires organizing information into effective cases. This paper argues that using connectivity...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Topics in cognitive science

دوره 3 1  شماره 

صفحات  -

تاریخ انتشار 2011